Validating Constraints in XML
نویسندگان
چکیده
The role of XML in data exchange is evolving from one of merely conveying the structure of data to one that also conveys its semantics. In particular, several proposals for key and foreign key constraints have recently appeared, and aspects of these proposals have been adopted within XMLSchema. Although several validators for XMLSchema appear to check for keys, relatively little attention has been paid to the general problem of how to check constraints in XML. In this paper, we examine the problem of checking keys in XML documents and describe a native validator based on SAX. The algorithm relies on an indexing technique based on the paths found in key definitions, and can be used for checking the correctness of an entire document (bulk checking) as well as for checking updates as they are made to the document (incremental checking). The asymptotic performance of the algorithm is linear in the size of the document or update. We also discuss how XML keys can be checked in relational representations of XMLdocuments, and compare the performance of our native validator against hand-coded relational constraints. Extrapolating from this experience, we propose how a relational schema can be designed to check XMLSchema key constraints using efficient relational PRIMARY KEY or UNIQUE constraints. Comments University of Pennsylvania Department of Computer and Information Science Technical Report No. MSCIS-02-03. This technical report is available at ScholarlyCommons: http://repository.upenn.edu/cis_reports/31 Validating Constraints in XML Yi Chen, Susan B. Davidson and Yifeng Zheng
منابع مشابه
DTD++ 2.0: Adding support for co-constraints
In this paper we present an evolution of the DTD++ schema language for XML documents. The original DTD++ language provided support for a large and significant subset of XML Schema while maintaining a syntax closely resembling DTDs: thus the expressive power of XML Schema and the readability of DTDs were both supported in a modular architecture that could rely on a number of validating engine fo...
متن کاملAttribute Grammar for XML Integrity Constraint Validation
The main contribution of this paper is a generic grammarware for validating XML integrity constraints. Indeed, we use an attribute grammar to describe XML documents and constraints. We thus explain the main parts of this novel algorithm and we report on experiments showing that our method allows for an effective and efficient validation of XML functional dependencies (XFD).
متن کاملA Grammarware for the Incremental Validation of Integrity Constraints on XML Documents under Multiple Updates
This paper extends our previous work by showing our grammarware as a generic way for validating XML integrity constraints, both from scratch and incrementally during document updates. Indeed, our generic attribute grammar can stand for any XML constraint validator, provided that the constraint is defined with paths. We specify a common definition framework for seven classes of XML integrity con...
متن کاملXML Data Constraint and XINCAML
XML is becoming the de facto standard for data exchange. Because it brings structures and semantics to the contents, it is very important for applications to verify the validity of XML data before further processing. W3C XML Schema language can specify many of the constraints in XML data, but it lacks of the capability of expressing application specific inter-node constraints. Therefore XincaML...
متن کاملVisual Modeling of XML Constraints Based on a New Extensible Constraint Markup Language
With the mature of e-business on the Internet, the eXtensible Markup Language (XML) is rapidly becoming the industry standard for business-to-business (B2B) data integration. While Document Type Definitions (DTDs) and XML Schemas can be used to specify and validate syntactic constraints on XML documents, currently there are no effective languages or tools for specifying and validating semantic ...
متن کامل